173 results found.
Language Type:
Multilingual
Languages:
American English
Availability:
Freely Available
License:
Apache License, Version 2.0
Size:
2.1 Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Locating Requests among Open Source Software Communication Messages
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Yannis Korkontzelos | Department of Computing, Edge Hill University | GB | National Centre for Text Mining, The University of Manchester | GB |
| Author 2 | Sophia Ananiadou | University of Manchester | GB | ||
| Main Contact | Yannis Korkontzelos | Department of Computing, Edge Hill University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
American English
Availability:
Freely Available
License:
<Not Specified>
Size:
855 sentences Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Corpus Annotation as a Scientific Task
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Donia Scott | University of Sussex | None |
| Author 2 | Rossano Barone | University of Sussex | None |
| Author 3 | Rob Koeling | University of Sussex | None |
| Main Contact | Donia Scott | University of Sussex | GB |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
American English Mandarin Chinese
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
18000 sentencesProduction Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Building a Hierarchically Aligned Chinese-English Parallel Treebank
-
Paper track:Resources
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dun Deng | Brandeis University | US |
| Author 2 | Nianwen Xue | Brandeis University | US |
| Main Contact | Dun Deng | Brandeis University | None |
Documentation:
English guidelines in progress
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
American English
Availability:
Freely Available
License:
CreativeCommons
Size:
207 hours Production Status:
Existing-updated-(release this year)
Use:
Speech Recognition/Understanding
-
Paper title:Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Anthony Rousseau | LIUM | FR | ||
| Author 2 | Paul Deléglise | LIUM | FR | Université du Maine, LIUM | None |
| Author 3 | Yannick Estève | LIUM | FR | Université du Maine, LIUM | None |
| Main Contact | Anthony Rousseau | LIUM | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Trilingual
Languages:
American English Catalan Spanish
Availability:
Freely Available
License:
OpenSource
Size:
263620 <Not Specified>Production Status:
Existing-updated
Use:
Word Sense Disambiguation
-
Paper title:Multilingual Central Repository version 3.0
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Author 1 | Aitor Gonzalez-Agirre | Euskal Herriko Unibertsitatea, Informatika Fakultatea | None | ||||||
| Author 2 | Egoitz Laparra | Euskal Herriko Unibertsitatea, Informatika Fakultatea | None | ||||||
| Author 3 | German Rigau | <Not Specified> | None | University of the Basque Country | None | IXA NLP Research Group | None | Euskal Herriko Unibertsitatea, Informatika Fakultatea | None |
| Main Contact | Aitor Gonzalez-Agirre | IXA Group, UPV/EHU | ES |
Documentation:
http://adimen.si.ehu.es/web/MCRLanguage Type:
Multilingual
Languages:
American English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
7 <Not Specified>Production Status:
Newly created-finished
Use:
<Not Specified>
-
Paper title:Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
-
Paper track:Terminology
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Luís Marujo | <Not Specified> | None | LTI/CMU and INESC‐ID/IST | None | ||
| Author 2 | Anatole Gershman | <Not Specified> | None | LTI/CMU | US | ||
| Author 3 | Jaime Carbonell | <Not Specified> | None | LTI/CMU | US | Carnegie Mellon University | N/A |
| Author 4 | Robert Frederking | <Not Specified> | None | ||||
| Author 5 | João P. Neto | <Not Specified> | None | ||||
| Main Contact | Luís Marujo | LTI/CMU and INESC-ID/IST | PT | LTI/CMU and INESC‐ID/IST | PT |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
American English English
Availability:
From Owner
License:
<Not Specified>
Size:
100 MByte Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marilyn Walker | <Not Specified> | None | UCSC UARC | None |
| Author 2 | Grace Lin | <Not Specified> | None | ||
| Author 3 | Jennifer Sawyer | <Not Specified> | None | ||
| Main Contact | Marilyn Walker | UCSC | US |
Documentation:
<Not Specified>
Written
Terminology,
Language Type:
Trilingual
Languages:
American English German Spanish
Availability:
Freely Available
License:
OpenSource
Size:
3500 <Not Specified>Production Status:
Existing-updated
Use:
knowledge acquisition and text production in the environmental domain
-
Paper title:Linguistic knowledge for specialized text production
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | MIRIAM BUENDÍA-CASTRO | <Not Specified> | None | ||
| Author 2 | Beatriz Sánchez-Cárdenas | <Not Specified> | None | LEXICON, University of Granada | ES |
| Main Contact | MIRIAM BUENDÍA-CASTRO | University of Granada | ES |
Documentation:
http://ecolexicon.ugr.es/en/aboutecolexicon.htm
Written
Tagger/Parser,
Language Type:
Trilingual
Languages:
American English Spanish french
Availability:
From Owner
License:
proprietary
Size:
500 MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Facing the Identification Problem in Language-Related Scientific Data Analysis.
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Joseph Mariani | LIMSI-CNRS | FR | LIMSI-CNRS & IMMI | FR |
| Author 2 | Christopher Cieri | LDC | US | ||
| Author 3 | Gil Francopoulo | Tagmatica + IMMI-CNRS | FR | ||
| Author 4 | Patrick Paroubek | LIMSI-CNRS | FR | ||
| Author 5 | Marine Delaborde | LIMSI-CNRS | FR | ||
| Main Contact | Joseph Mariani | LIMSI-CNRS | None |
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
American English
Availability:
Freely Available
License:
Illinois Open Source License
Size:
34 <Not Specified>Production Status:
Existing-used
Use:
NLP Tool provided as service
-
Paper title:An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
-
Paper track:General issues
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | James Clarke | University of Illinois (Urbana-Champaign) | None |
| Author 2 | Vivek Srikumar | University of Illinois (Urbana-Champaign) | None |
| Author 3 | Mark Sammons | University of Illinois (Urbana-Champaign) | None |
| Author 4 | Dan Roth | University of Illinois (Urbana-Champaign) | None |
| Main Contact | Mark Sammons | University of Illinois | US |
Documentation:
http://cogcomp.cs.illinois.edu/page/download_view/13 (overview) and README in source jar, English




